Relative Functional Comparison of Neural and Non- Neural Approaches for Syllable Segmentation in Devnagari TTS System. Prof Mrs
نویسندگان
چکیده
This paper presents methods for automatic speech signal segmentation using neural network. Speech signal segmentation is carried out to form syllables. Syllable is a common unit for concatenative TTS systems. Concatenative TTS being using speech segments of recorded speech is natural as compare to Formant or Articulatory TTS systems. This TTS stores small segments of speech and join them together to form new word. This helps to generate more number of words based on very small database. As manual segmentation is very time consuming and it has certain limitation on naturalness, some neural network models are used to improve naturalness of resulting segments in speech synthesis. The proposed work explains how neural network approaches like Maxnet, K-means outweighs in performance than traditional non neural approaches like slope detection and simulated annealing. About more than 90% accuracy is achieved with neural network models for syllable segmentation which resulted in naturalness improvement of Marathi TTS.
منابع مشابه
An Automated MR Image Segmentation System Using Multi-layer Perceptron Neural Network
Background: Brain tissue segmentation for delineation of 3D anatomical structures from magnetic resonance (MR) images can be used for neuro-degenerative disorders, characterizing morphological differences between subjects based on volumetric analysis of gray matter (GM), white matter (WM) and cerebrospinal fluid (CSF), but only if the obtained segmentation results are correct. Due to image arti...
متن کاملکاهش رنگ تصاویر با شبکههای عصبی خودسامانده چندمرحلهای و ویژگیهای افزونه
Reducing the number of colors in an image while preserving its quality, is of importance in many applications such as image analysis and compression. It also decreases memory and transmission bandwidth requirements. Moreover, classification of image colors is applicable in image segmentation and object detection and separation, as well as producing pseudo-color images. In this paper, the Kohene...
متن کاملGlobal Syllable Vectors for Building TTS Front-End with Deep Learning
Recent vector space representations of words have succeeded in capturing syntactic and semantic regularities. In the context of text-to-speech (TTS) synthesis, a front-end is a key component for extracting multi-level linguistic features from text, where syllable acts as a link between lowand high-level features. This paper describes the use of global syllable vectors as features to build a fro...
متن کاملNeural Network Approach for Herbal Medicine Market Segmentation
Market segmentation is the start point of executing targeted marketing strategy. This study aims to determine fit dimensions and appropriate specifications for the segmentation of herbal medicines market in order to provide production and market departments with fit strategies by identifying the profile of the market customers and recognizing their differences in the identified indices. This is...
متن کاملP63: Automatic Detection of Glioblastoma Multiforme Tumors Using Magnetic Resonance Spectroscopy Data Based on Neural Network
Inflammation has been closely related to various forms of brain tumors. However, there is little knowledge about the role of inflammation in glioma. Grade IV glioma is formerly termed glioblastoma multiform (GBM). GBM is responsible for over 13,000 deaths per year in the America. Magnetic resonance imaging (MRI) is the most commonly used diagnostic method for GBM tumors. Recently, use of the MR...
متن کامل